LESS (PRECISION) IS MORE (INFORMATION): QUANTUM 
INFORMATION IN FUZZY PROBABILITY THEORY 



P. BUSCH 

A comparison of structural features of quantum and classical physical theories, 
such as the information capacity of systems subject to these theories, requires a 
common formal framework for the presentation of corresponding concepts (such as 
states, obscrvables, probability, entropy). Such a framework is provided by the no- 
tion of statistical model developed in the convexity approach to statistical physical 
theories. Here we use statistical models to classify and survey all possible types 
of embedding and extension of quantum probabilistic theories subject to certain 
reasonable constraints. It will be shown that the so-called canonical classical ex- 
tension of quantum mechanics is essentially the only 'good' representation of the 
quantum statistical model in a classical framework. All quantum observables are 
thus identified as fuzzy classical random variables. 



1 Introduction 

Every physical theory can be formulated as a probabilistic theory: each type 
of physical system is characterized through a set of observables, representing 
the possible measurements that can be performed on the system, with an 
associated set of states, assigning probabilities to the measurement outcomes. 
The duality of states and observables is formalized in the notion of a statistical 
duality, also called convexity model or statistical model, of which quantum 
mechanics and classical statistical mechanics are particular realizations. This 
structure has been used as the starting point for a reconstruction of quantum 
mechanics as a statistical physical theory in fundamental studies in the 1960s- 

70z mmm\ 

In the past decade, statistical models have been the basis for a series of 
very original and penetrating investigations by the late Slawek Bugajski and 
his collaborators (Enrico Beltrametti and Werner Stulpe). They studied a 
particular classical representation of quantum mechanical density operators 
and observables in terms of probability measures and functions over a 'phase 
space' which is identified with the set of pure quantum states. This represen- 
tation was first formulated by B. Misra ^ anc [ others in the 1970s, but its 
significance as 'the' canonical classical extension of quantum mechanics was 
recognized by S. Bugajski d. E. Beltrametti and S. Bugajski exhibited some 
intriguing features of the canonical classical extension,notably among them 
the so-called Bell phenomenon in a classical framework l«l and the fact that all 
quantum observables (positive operator valued measures, or POVMs), whether 
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sharp (projection valued measures, PVMs) or not, are represented classically 
as fuzzy random variables. These observations led to the inception and de- 
velopment of a generalized classical probability theory, referred to as fuzzy 
probability theory 

EMU! 

This line of research was a demonstration of Slawek Bugajski's creativity as 
a researcher in mathematical physics and the foundations of quantum mechan- 
ics. His late work, abruptly cut short by his untimely death in March 2003, 
comprises a legacy full of intriguing ideas and questions for future research 
waiting to be carried further 1*1 Slawek had hoped to demonstrate that the 
formalisms of statistical models and fuzzy probability theory are useful tools 
for investigations in quantum information science, particularly in quantum 
computation. The aim of the present contribution is to give a brief exposition 
of this approach to quantum mechanics and to help build an intuitive under- 
standing of the intriguing features referred to above. I will review the concept 
of statistical models, recall the quantum and classical examples, and provide 
a comprehensive account of all possible mutual embedding schemes. I will 
show that the canonical classical extension of quantum mechanics is the only 
'good' classical representation of quantum mechanics as a statistical model. 
Since quantum observables are in this description represented as fuzzy clas- 
sical random variables, it follows that the enhanced information storage and 
cryptographic capacities of quantum systems must be seen as a consequence 
of the fact that the set of quantum observables appears as a restriction of the 
set of all classical random variables to a subset of fuzzy random variables. 

The title phrase, "Less is more...", refers to the fact that quantum un- 
certainty can be utilized as resources for information processing purposes in 
the broadest sense. For example, noncommuting pairs of observables can be 
measured jointly if one allows for an appropriate pay-off between the degrees of 
unsharpness in the respective marginal measurements, quantified by suitable 
uncertainty relations. Another example is given by the existence of informa- 
tionally complete POVMs, which are necessarily unsharp observables. Further, 
it has been observed that certain state discrimination procedures are optimally 
performed by POVMs which are not PVMs. Finally, measurements which are 
less precise may be designed so as to be less invasive, thus allowing more 
control over the observed system. Incidentally, Th. Konrad chose the same 
motto for his doctoral thesis on unsharp quantum measurements ESI While 
our choices were made independently of each other, we may both have b een 
(consciously or subconsciously) influenced by H. Dehmelt's famous review El 
which was written in the same spirit. 

The reader may wish to read the pres ent article in conjunction with the 
related contributions by Chris Fuchs LL^I and Guido Bacchiagaluppi in this 
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volume. The former uses the afhne classical embedding of quantum states 
induced by an informationally complete POVM to explore aspects of the con- 
cept of quantum information. The latter investigates the description of state 
changes in the framework of the Misra-Bugajski canonical classical extension. 
This work is dedicated to the memory of Slawek Bugajski. 

2 Statistical models 

Every type of physical system can be prepared and observed in a variety of 
ways. In order to exhibit regularities in the observation data, it must be 
possible to reproduce (practically) the 'same' conditions of preparation and 
to carry out operations which can be identified as (practically) the 'same' 
measurements, so as to be able to determine that different preparations tend 
to lead to different measurement outcomes. Thus, the theoretical modelling 
of a physical system should be based on the structures of a set S of states, 
which represent the different possible, reproducible preparations, and a set O 
of observables, which represent the different possible measurements. A state 
and an observable should together determine a probability distribution that 
corresponds to the expected frequencies of the outcomes of the same (class of) 
measurement, repeated many times on a system (or a collection of independent 
systems) prepared in the same state. 

These ideas entail some basic properties of the sets S and O which we 
will spell out next. The possibility of randomly choosing different states, with 
fixed probability weights, and subjecting them to independent runs of the same 
measurement implies that the set of states should be convex, thus including 
all mixtures of any finite or (by extrapolation) countable subsets. Moreover it 
follows that each measurement outcome, labelled by a value a k (k — 1, 2, . . . ), 
say, gives rise to an affine map 

E fc : S -» [0, 1] , p i-f E k (p) = p p ( 0fc ) (1) 

where p p (a k ) is the probability of the outcome a k in the state p £ S. We can 
ensure that every measurement is certain to produce an outcome by allowing 
for a null event, a , to represent the situation where there is no response. 
It follows that the sum of probabilities ^2 k p P {a k ) = 1 f° r all states p. We 
summarize this by writing J2 k ^ k = ^ where / : p >— ► 1 is the constant unit 
map that sends every state to the number 1. The totality of all maps E k 
defines the observable measured in the experiment under consideration. 

This consideration generalizes as follows. The set of states of a given type 
of physical system is modelled as a convex set S. The set O of observables is 
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given by a set of affine maps 

A:S^M(n)+ , p»A{p)=tf (2) 

from S to the convex set M of probability measures over some measurable 
spaces (f2, £), where fl denotes the set of measurement outcomes and £ a a- 
algebra of subsets of appropriate to each measurement represented in the 
model. 

Each observable A, together with a subset IeE, gives rise to an affine 
map 

£7 (X): 5 -[0,1], p^E{X){p)=:A{p){X) = E p {X). (3) 

Any affine map a : S — > [0, 1] is called an effect. The map 

S3Ih£(I) (4) 

has the properties of a measure, with normalization E (O) = /, the property 
E (0) = O (=thc constant zero map which is itself an effect), and c-additivity 
following from that of all the probability measures E p . We see that any ob- 
servable A gives rise to an effect valued measure E = E A . Conversely, every 
effect valued measure E on some measurable space (f2, S) induces an observ- 
able A = A E as an affine map from S to M (0)^~. We have thus, in effect, two 
equivalent definitions of an observable. 

Given the set O of observables, we can define the set of all physically 
realizable effects, denoted £, as the union of the ranges of all the effect valued 
measures associated with the observables in O. This set £ is thus a subset 
of the set £ (S) of all effects on S. The set £ inherits a natural partial order 
defined in £ (S) as follows: 

a < b if and only if a(p) < b (p) for all p e S. (5) 

We can thus write £ (S) = [0,1]. The set £ represents the collection of all 
simple observables, that is, those with only two outcomes (also called yes- no 
observables): for a e £, we also have a' := I — a e £, and the map {1} i— > a, 
{0} i ^ a' defines an effect valued measure on (the power set of) fi = {0, 1}. We 
can take O to be closed under all coarse-grainings of its observables (understood 
as restrictions of the associated effect valued measures on (17, S) to sub-<7- 
algebras of S. 
The map 

£ (S) 3 a ^ a' = I - a e £ (S) (6) 
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defines a kind of complement, or negation, and one can consider a definition 
of weak orthogonality of effects: a ± b if a + b < I. This relation is not an 
orthocomplement since one can have pairs of effects satisfying a _L b while both 
have a common nonzero lower bound, O < c < a,b. With the partial operation 

£ 3 a, b^ a© 6 := a + b if a + b E £, (7) 

the set of effects £ assumes the structure of an effect algebra. The connection 
between effect algebras and statistical models has been thoroughly analyzed 
by Beltrametti and Bugajski El and independently by Gudder. 1^ 

We will make the assumption that the set of physical effects £ separates 
S, that is: for any two different p, p' 6 S, there is an effect a 6 £ such that 
a(p) ^ a (//). This is a natural assumption and can be satisfied by identifying 
all states that would otherwise be indistinguishable. We then summarize these 
consideration by defining a statistical model as a pair (S, £) consisting of a 
convex set of states and a separating set of physical effects. 

It is convenient to consider convex structures as embedded into their nat- 
ural linear extensions. As a convex set, S is part of a vector space V = V (S) 
over the field of real numbers which we shall consider to be the span of S. 
The set of effects £ (S) is a convex subset of the vector space A = Ab {S) of all 
(real-valued) bounded affine functional of S. Let A* denote the algebraic dual 
space to A, that is, the set of linear functionals on A. Then, for p 6 V, the 
map a i— > a (p) is a linear functional on A, which implies that V is injectively 
embedded into A* . Thus we can introduce a (nondegenerate) bilinear form on 
,4* x A, (•, •), such that for p e V, 

(p,a) = p(a) =a(p) . (8) 

With the vector space W = W (£) generated as the span of £ , the pair (V, W), 
equipped with the nondegenerate form (-|-), is referred to as a dual pair. Every 
statistical model can thus be embedded in a dual pair of vector spaces. 

The condition that £ separates S ensures that the vector space V can be 
equipped with a norm whose unit ball is B :— conv (S U — <S); this norm, called 
the base norm, is the Minkowski functional of B, defined via 

m(/ > ):=inf{A:p€AB}=:||p|| 1 . (9) 

In this way, V is a base norm space. Since bounded affine functionals on S 
extend uniquely to bounded linear functionals on V, Ab {S) is a subspace of 
the dual space V* of V. The set [0,1] — £ (S) C Ab (S) is called order unit 
interval, and it makes Ab (S) an order unit space with the order unit norm 
equal to the dual space norm inherited from V* and the norm unit ball given 
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by [—/,/]. If the normed vector spaces V, W are complete, the pair (V, W) is 
called a statistical duality. A lucid introduction to the se s tructures and their 
physical context can be found in the work of W. Stulpe f or further details, 
cf. also Ref. ^ and references therein. 

3 Classical and quantum statistical models 

The traditional classical statistical model is determined by a measurable space 
(T,£>(r)), where T is the state space of a dynamical system and B (T) is a 
CT-algebra of subsets of T. Usually, T is a topological or metric space, e.g. a 
(subspace of) M™, and B (T) will then be the associated Borel algebra. The 
set of (statistical) states S c is usually taken to be the set of all measures 
M± (r, B(r)) or sometimes the set of (measures with) probability densities 
with respect to some reference measure, L 1 (T,dm)t, such as L 1 (R ra ,<i n x)^. 
Following Beltrametti and Bugajski El we define the set of measurable effects 
£ c as the collection of affine functionals on S c , determined by some measurable 
function /:r-> [0,1]: 

a f : S c -> [0, 1] , fj, i-> a/ (/x) = J f dfx. (10) 

The order relation af < a g is now equivalent to / < g in the case S c = 
(r, B(r)) [take fi — <5 7 , the Dirac measure supported at 7 £ 1] or / < g 
almost everywhere with respect to the measure m for iS c = L 1 (T, dm)^ . 

An observable is an affine map A : S c — > M (CI, S)^ from the given set 
of classical states to the set of probability measures on a measurable space 
(f2, S) of measurement outcomes. This comprises the traditional definition of 
a classical observable as a function on phase space (random variable). Let 
F : r — > Cl be a measurable function, then we define an affine map Ap : S c — -> 
M (O, S)^ as follows: 

A Fi i (X) := /i (F- 1 (A)) = y <5 F(7) (A) dp (7) , A e S . (11) 

The associated effect valued measure is given by 

Ep^F- 1 :E^B(r) , A^F- 1 (A)-a Xj? _ 1(x) . (12) 

As a function on T, this corresponds to the indicator function Xf- 1 (x)- (Note 
that Xf~ 1 (X) (7) — ^FM (^)j hence the above integrand is a measurable and 
indeed integrable function.) 
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This construction of the standard classical observable extends to the class 
of observables with measurable effects as follows. The function (7, X) 
Xf~ 1 (x) (7) is a particular instance of a Markov kernel 

K:TxZ^[0,l] , ( 7 ,X)~K( 7 ,X), (13) 

that is, a map with the property that K (7, ■) e M (CI, E)^~ and K (■, X) is a 
measurable function, in fact, an effect. Then the following defines an observable 
Ak which represents a fuzzy random variable: 

AkV (X) := J^K (7, X) dix (7) , X e £ , (14) 

with associated effect valued measure 

E K :Y,^£ C , X i-> Or- ( . ; x) • (15) 

A quantum statistical model is usually based on the set of states S q = 
S (TL) given by the density operators p of a separable complex Hilbert space 
TL. Conventional quantum mechanics takes its set of physical effects as 
the set of all orthogonal projection operators P, P = P* = P 2 . Operational 
quantum physics is based on the set of all effects £ q — £ (H) = [0,1], given 
by the operators a for which O < a < I. In both cases, projections and effects 
define affinc functionals on S q via the trace formula, a (p) — tr [p ■ a] . Both 
sets £? and £ q are effect algebras, with the set of projections being the set of 
extreme elements of the convex set of effects. £? is an orthocomplemented non- 
Boolean lattice under the ordering a < b and complementation map a ^ a' 
inherited from £ q . 

A characteristic difference between quantum and classical statistical mod- 
els is given by the fact that S c is a simplex (all states have a unique repre- 
sentation as a convex combination (finite, countable or continuous) in terms 
of the extreme elements of S c , while all mixed quantum states allow many 
decompositions into pure states. We illustrate this state of affairs by means of 
the smallest nontrivial example of a quantum statistical model associated with 
the Hilbert space H = C 2 . This will be compared with the classical statistical 
model based on a set O = {1, 2, 3, 4}. 

The classical statistical model for fi 4 = {1,2,3,4}, with S = V (Q 4 ) (the 
power set of O4), is given by S (O4) C K 4 = V (O4), where 

S (0 4 ) = {P = ipi,P2,P3,Pl) -Pk>0, Pi +P2+P3+P4 = 1} • (16) 

This is a tetrahedron with vertices (1,0, 0, 0) , (0, 1, 0, 0) , (0, 0, 1, 0) , (0, 0, 0, 1). 
The set of effects £ comprises all the maps 

a : p >-> a(p) = a {S^PkCk) = ^^Pka{e k ) = ^p fe a fe (17) 
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for which a (p) G [0,1]. [Here the e k denote the Cartesian unit basis vectors, 
and a/c := a(efe).] This condition entails that < a k < 1. Hence the set of 
effects can be identified as a subset of M 4 = W (ft 4): 

£ (0 4 ) = {a = (0,1,02,0,3, 04) : < a k < 1} . (18) 

This is a hypercube with the vertices given by the 16 points whose coordinates 
are quadruples of 0s and Is. The bilinear form associated with the dual pair 
(V (f^) , W (O4)} is the Euclidean inner product, 

V (0 4 ) x W (0 4 ) 9 (p, a) 1 > (p, a) = XI P fe ° fe = P ' a • ( 19 ) 

The quantum statistical model for H — C 2 (a spin-l system or generally 
a qubit) can be parameterized in the Cayley representation as follows. As a 
basis of the space of 2x2 matrices (linear operators of C 2 ) we take 




where the a k are the Pauli matrices. Every density operator can be written 
as p = 1 (r I + r ■ a) [where r = (n, r 2 , r 3 ) G M. 3 , a = (ai, a 2 , 03)] and is thus 
associated with a vector p = (tq, r) G K 4 . The fact that p assigns probability 
one to the effect I entails that tr[p] = r = 1. The condition that p> O can be 
expressed in terms of the eigenvalues of p: h (1 ± |r|) > 0, which is equivalent to 
the Euclidean norm |r| < 1. The set S (C 2 J is thus isomorphically represented 
by a subset § (C 2 ) ofK 4 , 

S (C 2 ) = {p:ti\p] = l,p>0}~S (C 2 ) ={p= (1, r) : |r| < 1} . (21) 

S (C 2 ) is a 3-dimensional unit ball embedded in the hyperplane ro = 1. The 
effects of C 2 arc the operators (matrices) a = \ (oqI + a • a) satisfying the 
condition O < a < I, which is equivalent to < \ (a ± |a|) < 1. Thus £ (C 2 ) 
is isomorphic to a subset £ (C 2 ) of R 4 , 

£(C 2 ) ={a:0<a< 1} ^ £ (C 2 ) = {a = (a ,a) : \ (a ± |a|) < 1} . (22) 

£ (C 2 ) (or rather £ (C 2 )) can be visualized as a convex diamond-shaped figure 
that is the convex hull of the 'top' and 'bottom' vertices given by I <-> (2,0), 
O <-> (0, 0) and the surface of the sphere {(1, a) : |a| = 1}. 

The bilinear form associated with the dual pair of vector spaces which host 
S (C 2 ), £ (C 2 ) is given as follows: 

(p, a) = tr [p ■ a] = \ (a + a • r) = \a ■ p (23) 
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4 Embeddings and extensions of statistical models 

We consider next the various possible ways of embedding or extending one 
statistical model (S,£) (in)to another, (S',£'}, in a 'natural' way. That is, 
we are asking for an association of states p and effects a of (S, £) with states 
p' and effects a 1 of (S',£'), p,a <-> p',a', which is such that probabilities are 
preserved: (p, a) = (p',a'}; further we require that the convex structures are 
respected by the sought correspondence. 

We shall investigate two obvious choices which ensure that all states of S 
will be represented: 

(A) There is an injective affine map $ : S — > S'; this embedding map should 
give a faithful representation of the states of S in a possibly 'larger' state 
space.. 

(B) There is a surjective affine map R : S' — > S; this reduction map would 
describe the system represented by S, possibly as a part of subsystem of 
a larger, compound system. 

In fact, both maps could exist simultaneously, being inverses to each other. 
In this case the two state spaces are isomorphic. In both cases (A) and (B) we 
will have to determine whether all effects of £ can indeed be represented by 
effects of £', as envisaged. 

Assume an affine map $ : S — > S' is given. This extends uniquely to a 
linear map from V (S) to V (S 1 ), which we will also denote $. This induces the 
dual map $* : V (5')* — > V (<S)*, defined by the condition 

(*p,o') = (p,*V>, i.e., $V = o' o$, peV(S) , a' eV(S')* . (24) 

sends £ (S') into £ (S): indeed, for a' e £(S'), we have ($V) (p) = 
a' ($p) e [0, 1] for all p E S, that is, $*a' € £ (<S). The requirement of injec- 
tivity of $ implies that the range of is (weak-*) dense in V {S) . However, 
this does not imply that $* {£ {S')) is dense in £{S). We will say that an 
injective affine map $ : S — > 5' induces a 'good' embedding of the model (5, £} 
within (S',£') if <&* (£') is at least dense in £, so that every physical effect of 
the original model can be described by a physical effect of the extended model. 
We shall see that there is no 'good' classical embedding of the full quantum 
statistical model. 

Now consider the case where there exists an affine, surjective reduction 
map R : S' — > S. This having a surjective extension to V (S r ), it follows that 
the dual map R* provides an injective representation of all effects in £ (S) as 
effects in £ (S') via R*a = aoR. The map R is thus seen to be a 'good' extension 
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of the statistical duality (S,£) to (S',£ r ) (provided it exists). We will review 
the Misra-Bugajski map as an example of a 'good' classical extension of the 
quantum statistical model. 

5 Embeddings and extensions of quantum statistical models 

We give an overview of the known types of embeddings and extensions of 
quantum statistical models, including the classical representations. We begin 
with an 'unsuccessful' attempt, one which nevertheless has provided important 
insights into the structure of quantum mechanics. 

5.1 Wigner function - a counter example 

The question of classical representations of quantum mechanics is as old as 
quantum mechanics itself: already in Born's famous paper of 1926 introducing 
the probability interpretation El the question of extensions of quantum me- 
chanics in the framework of a classical theory with hidden variables was raised. 
A few years later, Wigner EH attempted to establish a phase space formulation 
of quantum mechanics and found an injective (in fact isometric) affine map 
W : S q — > L 1 (r, dq dp) of the density operators p of a quantum particle to in- 
tegrable and normalized functions of phase space. This map yields the correct 
marginal position and momentum distributions associated with each p, but the 
functions Wp (q,p) are not nonnegative (apart from a 'few' exceptions such as 
Gaussian wave functions). That is to say that the Wigner map W does not 
map into L 1 (T^dqdp)^, the probability densities on phase space. This rules 
out W as a classical embedding of quantum mechanics in the sense defined 
here. a 

5.2 Classical embedding induced by an observable 

However, we have already seen examples of candidates for classical embed- 
dings, namely, in the form of the affine maps A : S q — > M± (£7, S) which define 
the observables of the quantum statistical model. The dual map sends classi- 
cal effects to quantum effects; in particular it takes the 'crisp' (sharp) effects 
represented by characteristic functions to the effects in the range of the POVM 
E A associated with A: 

Sjlni' (a Xx ) = E A (X) E E q . (25) 

^See also the penetrating remarks of S. Bugajski El 
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Among the quantum observables A there are injective ones, and the associated 
effect vaiued measures (or POVMs) are called informationally complete. 13 If E A 
is informationally complete, the dual map A* sends the dual space of M (CI, E) 
onto a dense subspace of the space of bounded selfadjoint operators (the dual 
of the space of selfadjoint trace class operators hosting S q ). However, the set 
of measurable classical effects which arises as the convex hull of all a Xx can 
never exhaust, or be dense in, the set of quantum effects £ q : this is due to the 
fact that for an informationally complete POVM, the effects E A (X) cannot be 
projections other than O or I.° This means that an informationally complete 
observable A does not induce a 'good' classical embedding of the full quantum 
statistical model (S,£) but only of a reduced quantum model (<S, £ (A)}, with 
£ replaced by £ [A) — E A (E), the separating effect algebra consisting of the 
range (or the convex hull of the range) of the POVM E A . 

An example of an informationally complete observable is given by the co- 
herent state based phase space POVM, which corresponds to the Husimi dis- 
tribution functions associated with each quantum state. These are bona fide 
phase space probability densities but the price to be paid is that its marginal 
position and momentum distributions are convolutions of the standard quan- 
tum mechanical position and momentum distributions with Gaussian confi- 
dence distributions, thus ensuring that appropriate Heisenberg uncertainty re- 
lations for the inaccuracies are satisfied. Wigner's theorem can thus be inter- 
preted as a demonstration of the incompatibility of the standard position and 
momentum observables, while the existence of the Husimi and other, similar 
phase space distributions shows that joint measurements of position and mo- 
mentum are possible if an allowance is made for unsharpness in line with the 
uncertainty relation. d 

Informationally complete observables for Tt = C 2 are easily c onstructed ^4 
together with simple, realizable measurement schemes. 111 Geometrically, 
they can be represented as affine embeddings of the Poincare sphere S (C 2 ) 
into the set M± (f2, E). With the choice = {1, 2, 3, 4} one can ensure that 
the embedding is not only injective but that its linear extension is surjective. 
In this case the sphere S (C 2 ) is mapped onto an ellipsoid which is embedded 



b The notion of informationally complete observables and first examples are due to work of 
the late E. Prugovecki from the 1970s. Examples and a survey of the early literature on this 



concept can be found in EH 



C A proof of this fact was given by Bu sch and Lahti E3 Intuitive demonstrations for the 
finite dimensional case can be found in Efl 

d A detailed account of phase space observables, joint measurements of position and momen- 
tum, th e ro le of the Heisenberg uncertainty relation, and a survey of relevant literature is 
given in E3 
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into the tetrahedron S (O4). The dual map sends the hypercube £ (f^) into 
a 'stretched' hypercube inside the 'diamond' £ (C 2 ) , in such a way that the 
elements (1,1,1,1) and (0,0,0,0) [which represent the / and O effects] are 
mapped to (2,0) and (0,0), respectively. This description makes it evident 
that the extreme points of the set of quantum effects [other than / and O] 
cannot be represented in terms of classical effects, except, perhaps, in finitely 
many cases where some extreme points of the hypercube touch the extreme 
boundary of the diamond. 

5.3 Cayley representation of the C 2 statistical duality 

The association p < — > (l,r), a < — ► (ao> a ) reviewed above defines a bijective 
affine mapping $ : S (C 2 ) — ► <S(C 2 ). Likewise, the dual map $* : £ (C 2 ) — > 
£ (C 2 ) is an affine bijection. Hence we have both a 'good' embedding (via <&) 
and a 'good' extension (via <j> _1 ) of (S (C 2 ) , £ (C 2 )}. In fact the linear exten- 
sions of these maps are isometries with respect to the trace norm \\a\\ 1 =tr[|a|] 
and the norm ||(ao,a)|| = maxjao, |a|}. 

5.4 Gleason's theorem (and a simple variant) 

The essence of standard quantum mechanics is captured in the standard quan- 
tum statistical model (S q ,£Jj). Let v : £? — > [0, 1] be a generalized probability 
measure on the lattice of projections £jj, that is, a map which satisfies the 
conditions v (O) = 0, v (I) = 1, and v (J2k Pk) — J2k v (^k) f° r anv finite or 
countable set of pairwise orthogonal P^ £ Let denote the set of all such 
generalized probability measures. This is a convex set, and each projection 
P E £? defines an effect ap on via v 1— ► ap (v) = v{P). These effects 
separate since v (P) = v' (P) for all P implies v = v' . Hence (S?,£p) is 
a statistical model. Gleason's theorem asserts that if the dimension of the 
underlying Hilbert space is greater than 2, then for every v £ S%, there is 
a unique p £ S q such that v (P) = tr [p ■ P] for all P £ £?. This associa- 
tion i?G • v 1 — ► p = p v is bijective and thus affine. Hence it induces a 'good' 
embedding and extension of (S q ,£^) in terms of {Sp,£p). 

A similar but much simpler result arises for the statistical model (S q , £ q ) of 
operational quantum mechanics if we define S q to be the set of all generalized 
probability measures on the full set of quantum effects £ q . The set £ q comprises 
enough elements so as to include basis systems of effects afe such that ^ fc au is 
an effect. This readily entails that every generalized probability measure v on 
£ q extends to a bounded linear functional on the space of bounded selfadjoint 
operators. The er-additivity of v implies that this functional arises from a 
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density operator via the trace formula, v (a) = tr [p ■ a] . Hence we have a 
'good' embedding and extension of (S q ,S q ) to a statistical model (S q ,£ q ). c 
This time there is no restriction on the dimension of the Hilbert space. The 
generalized probability measures on the set of effects restrict to generalized 
probability measures on the projections, which entails that S q is a proper 
subset of iS^ in the case TC — C 2 while S q — in the case dimTC > 2. 

5.5 Compound systems extension 

Let TC — TC ® TC' be the tensor product of the Hilbert space TC of the system 
of interest with an auxiliary Hilbert space TC' . Denote by R the partial trace 
map from the trace class of TC onto the trace class of TC. This is a linear 
map, and its restriction to S(Tt) has as its range all of S (TC). The dual map 
R* : a t—t a ® / is a linear injection or all bounded operators of TC into the 
space of bounded linear selfadjoint operators of TC, with the property O < a < I 
< R* (a) < R* (I) = In this way the statistical model (S(TC), £(TC)) 

arises as a 'good' extension of the statistical model (S (TC) , £ {TC)). 

5.6 Canonical classical extension of quantum mechanics 

We now assume that we are given a 'good' classical extension induced by a 
surjective affine map R : (fi, S) — > S (TC). We will make a few assumptions 
on (£1, E) and take a few steps to eliminate 'redundancies'. This will lead us 
in a fairly natural way to exhibit the canonical classical extension introduced 
by Misra and recognized by Bugajski as a 'good' extension of the maximal 
quantum statistical model into a distinguished classical statistical model. The 
assumptions on (Q, S) are: 

(a) E contains all singletons {w}, u) € fi; 

(b) the extreme elements of (fi, E) are exactly the Dirac measures 8 Ul so 
that each /i £ (fi, E) can be uniquely written as 

jtx = / 5 W d\i (uj) . (26) 
Jn 

Consider any density operator p corresponding to a pure state, p — P v = 
\ip)((f\. Let p, £ M^(Q,E) be any measure for which Rp = p. It follows that 
Rfioj — P for all which occur in the convex decomposition (|26|l of /i, and 
every pure state will then be an image under R of some <5 W . We will consider 
f2 as restricted to those lo for which RS^ is pure. [Note that this presupposes 

e This fact is proved and discussed for quantum mechanics in E3 It has been proved in a 
much more abstract context by Beltrametti and Bugajski E| 
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that {u> € f2 : RS U is pure} 6 E. We will see presently that this can be trivially 
guaranteed.] Further we can identify (as equivalent) all to, to' £ f2 for which 
RS U = RSu', and replace ft with the set (also denoted Q) of all equivalence 
classes [ui], E with the cr-algebra (again denoted E) of sets [X] :— {[u] : to E X} 
and R with the induced map, also called i?, that sends 5r w i to RS^,. In this 
way we identify f2 with Ex [5 (7~£)], the set of pure density operators. 

Now we let f2 = Ex [S (H)] and to denote any pure state in S (H). The 
affine reduction map R must then satisfy 



R : [i — / Sujdfi (uj) i— » / R5u d/i (uj) = I wd(i (uj) = p M . (27) 
Jn Jn Jn 

The nondegenerate bilinear forms induced by the classical and quantum sta- 
tistical models involved is given by 



(p M) a) = tr \pn ■ a] = / tr [to ■ a] d/j, (u) = (fx, f a ) ■ (28) 
Jn 

This determines the dual map 

R* : a ^ f a , /„ (w) = tr [wo]. (29) 

For this formula to make precise sense, it is necessary that E is fixed in such 
a way that the functions f a are measurable. Misra *^ has shown how to 
achieve this. Thus every quantum effect a G £ (H) is represented as a classical 
measurable effect f a . Hence R induces a 'good' classical extension of the 
quantum statistical duality. f 

We can see from Eq. (|29H that all quantum effects, including the 'sharp' 
or 'crisp' ones given by projections, are represented by functions whose val- 
ues vary continuously between their maximum and minimum values. For any 
projection a not equal to / or O, f a (uj) assumes all values in [0,1]. Hence 
the projections are represented as classical fuzzy sets. All quantum observ- 
able, represented by POVMs E, including the PVMs, are described under the 
map R* as classical fuzzy random variables with associated Markov kernel 
K E (o),J)=tr[wS(I)]. 

The many-to-one relationship between the probability measures \x and the 
quantum states = Rfi reflects exactly the infinitely many ways in which 
every density operator which is not a pure state can be written as a convex 
combination of pure states. The set of all classical effect valued measures 

'The Misra-Bugajski map is an instance of a much more general mathematical result of 
Choquet on boundary measures of compact convex sets in the context of locally convex 
vector spaces. See Theorem I.. 4. 8 of Alfsen 
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R* o E represents all quantum mechanical measurements as classical fuzzy 
measurements which even taken together are too imprecise to separate the 
various different probability measures //,//,... which lead to the same density 
operator p = Rp = Rp' = . . . . 

6 Concluding remarks 

In this contribution I have reviewed the concept of statistical model and the 
various possible relations of embeddings and extensions between statistical 
models. The possible ways of constructing 'good' classical representations of 
either the full or some restricted quantum statistical models have been ex- 
hibited. The only 'good' classical extension of the full quantum statistical 
model is uniquely given (modulo redundancies) by the Misra-Bugajski map. 
This canonical classical extension of quantum mechanics does not constitute a 
hidden- variable completion of quantum mechanics because it does not render 
the sharp quantum properties (projections) as dispersion-free in the extremal 
classical states. On the contrary, under this classical extension all quantum 
effects - whether unsharp or sharp - are represented as classical fuzzy sets. 

While the classical embeddings of quantum mechanical statistical models 
via effect valued measures (which may or may not be informationally complete) 
have been extensively studied and led to a plethora of applications , much 
remains to be done to explore and exhaust the full potential of the canonical 
classical embeddings. In their very last joint work, Beltrametti and Bugajski 
made some tentative steps to characterize quantum correlations as opposed 
to classical correlations in this framework. The 'Bell phenomenon in classical 
framework', that is, the violation of Bell's inequalities for fuzzy random vari- 
ables corresponding to EPR-Bell observables, calls for an investigation of the 
notion of coexistence (joint measurability) of fuzzy random variables. More 
generally, it would be desirable to cast all the distinctive quantum structures, 
such as non-commutativity, complementarity, uncertainty relations, entangle- 
ment, in the language of the classical canonical extension. This should en- 
able us to relate these quantum features to concepts closer to our 'classical' 
experience and intuition - in other words: it should contribute to a better 
understanding of quantum mechanics. 
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